Overview

Dataset statistics

Number of variables25
Number of observations16554
Missing cells26047
Missing cells (%)6.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 MiB
Average record size in memory164.0 B

Variable types

Numeric16
Categorical9

Alerts

grade is highly correlated with bathrooms and 5 other fieldsHigh correlation
sqft_basement is highly correlated with bedrooms and 4 other fieldsHigh correlation
bathrooms is highly correlated with grade and 4 other fieldsHigh correlation
bedrooms is highly correlated with grade and 4 other fieldsHigh correlation
sqft_above is highly correlated with grade and 9 other fieldsHigh correlation
sqft_living15 is highly correlated with grade and 4 other fieldsHigh correlation
floors is highly correlated with sqft_above and 2 other fieldsHigh correlation
yr_renovated is highly correlated with jhygtfHigh correlation
yr_built is highly correlated with zipcode and 6 other fieldsHigh correlation
jhygtf is highly correlated with fue_renovadaHigh correlation
sqft_lot is highly correlated with sqft_lot15High correlation
price is highly correlated with grade and 3 other fieldsHigh correlation
sqft_lot15 is highly correlated with zipcode and 3 other fieldsHigh correlation
sqft_living is highly correlated with grade and 6 other fieldsHigh correlation
fue_renovada is highly correlated with jhygtfHigh correlation
antiguedad_venta is highly correlated with zipcode and 6 other fieldsHigh correlation
view is highly correlated with waterfrontHigh correlation
waterfront is highly correlated with viewHigh correlation
zipcode is highly correlated with yr_built and 2 other fieldsHigh correlation
condition is highly correlated with yr_built and 1 other fieldsHigh correlation
sqft_basement has 10142 (61.3%) missing values Missing
yr_renovated has 15905 (96.1%) missing values Missing
df_index has unique values Unique
jhygtf has 15905 (96.1%) zeros Zeros
antiguedad_venta has 320 (1.9%) zeros Zeros

Reproduction

Analysis started2022-10-04 04:35:39.353949
Analysis finished2022-10-04 04:36:33.134697
Duration53.78 seconds
Software versionpandas-profiling v3.3.0
Download configurationconfig.json

Variables

df_index
Real number (ℝ≥0)

UNIQUE

Distinct16554
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18811.53262
Minimum1
Maximum113866
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size129.5 KiB
2022-10-03T23:36:33.287711image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1130.65
Q16188.5
median14556.5
Q327262.5
95-th percentile51274.25
Maximum113866
Range113865
Interquartile range (IQR)21074

Descriptive statistics

Standard deviation16147.10472
Coefficient of variation (CV)0.8583619978
Kurtosis1.628791243
Mean18811.53262
Median Absolute Deviation (MAD)9673.5
Skewness1.265459165
Sum311406111
Variance260728990.9
MonotonicityNot monotonic
2022-10-03T23:36:33.509998image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
198571
 
< 0.1%
418611
 
< 0.1%
275911
 
< 0.1%
244021
 
< 0.1%
56791
 
< 0.1%
515231
 
< 0.1%
232411
 
< 0.1%
61841
 
< 0.1%
222331
 
< 0.1%
67651
 
< 0.1%
Other values (16544)16544
99.9%
ValueCountFrequency (%)
11
< 0.1%
21
< 0.1%
31
< 0.1%
51
< 0.1%
81
< 0.1%
91
< 0.1%
101
< 0.1%
121
< 0.1%
141
< 0.1%
151
< 0.1%
ValueCountFrequency (%)
1138661
< 0.1%
1119061
< 0.1%
1095711
< 0.1%
1083111
< 0.1%
992971
< 0.1%
981951
< 0.1%
980151
< 0.1%
947681
< 0.1%
940831
< 0.1%
937391
< 0.1%

zipcode
Real number (ℝ≥0)

HIGH CORRELATION

Distinct70
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean98079.01027
Minimum98001
Maximum98199
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size64.8 KiB
2022-10-03T23:36:33.703521image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/

Quantile statistics

Minimum98001
5-th percentile98004
Q198033
median98070
Q398118
95-th percentile98177
Maximum98199
Range198
Interquartile range (IQR)85

Descriptive statistics

Standard deviation53.58031141
Coefficient of variation (CV)0.0005462974317
Kurtosis-0.88492992
Mean98079.01027
Median Absolute Deviation (MAD)43
Skewness0.3725473414
Sum1623599936
Variance2870.849771
MonotonicityNot monotonic
2022-10-03T23:36:33.886538image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
98103477
 
2.9%
98115476
 
2.9%
98052467
 
2.8%
98034446
 
2.7%
98117442
 
2.7%
98038431
 
2.6%
98042431
 
2.6%
98118406
 
2.5%
98133402
 
2.4%
98023399
 
2.4%
Other values (60)12177
73.6%
ValueCountFrequency (%)
98001291
1.8%
98002160
1.0%
98003213
1.3%
98004205
1.2%
98005132
 
0.8%
98006377
2.3%
98007113
 
0.7%
98008226
1.4%
9801069
 
0.4%
98011150
 
0.9%
ValueCountFrequency (%)
98199234
1.4%
98198220
1.3%
98188108
 
0.7%
98178211
1.3%
98177192
1.2%
98168218
1.3%
98166200
1.2%
98155372
2.2%
9814851
 
0.3%
98146228
1.4%

grade
Real number (ℝ≥0)

HIGH CORRELATION

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.583967621
Minimum1
Maximum13
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size64.8 KiB
2022-10-03T23:36:34.048411image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6
Q17
median7
Q38
95-th percentile10
Maximum13
Range12
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.103965081
Coefficient of variation (CV)0.1455656374
Kurtosis1.371933156
Mean7.583967621
Median Absolute Deviation (MAD)1
Skewness0.7165229669
Sum125545
Variance1.2187389
MonotonicityNot monotonic
2022-10-03T23:36:34.166425image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
77145
43.2%
84770
28.8%
91881
 
11.4%
61592
 
9.6%
10698
 
4.2%
11211
 
1.3%
5183
 
1.1%
1240
 
0.2%
423
 
0.1%
137
 
< 0.1%
Other values (2)4
 
< 0.1%
ValueCountFrequency (%)
11
 
< 0.1%
33
 
< 0.1%
423
 
0.1%
5183
 
1.1%
61592
 
9.6%
77145
43.2%
84770
28.8%
91881
 
11.4%
10698
 
4.2%
11211
 
1.3%
ValueCountFrequency (%)
137
 
< 0.1%
1240
 
0.2%
11211
 
1.3%
10698
 
4.2%
91881
 
11.4%
84770
28.8%
77145
43.2%
61592
 
9.6%
5183
 
1.1%
423
 
0.1%

sqft_basement
Real number (ℝ≥0)

HIGH CORRELATION
MISSING

Distinct265
Distinct (%)4.1%
Missing10142
Missing (%)61.3%
Infinite0
Infinite (%)0.0%
Mean723.6218029
Minimum10
Maximum3500
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size129.5 KiB
2022-10-03T23:36:34.299433image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile180
Q1440
median700
Q3950
95-th percentile1400
Maximum3500
Range3490
Interquartile range (IQR)510

Descriptive statistics

Standard deviation386.7235438
Coefficient of variation (CV)0.5344277111
Kurtosis2.023896471
Mean723.6218029
Median Absolute Deviation (MAD)260
Skewness0.9228317222
Sum4639863
Variance149555.0993
MonotonicityNot monotonic
2022-10-03T23:36:34.672969image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
600179
 
1.1%
500173
 
1.0%
700166
 
1.0%
800157
 
0.9%
400138
 
0.8%
900110
 
0.7%
1000108
 
0.7%
300106
 
0.6%
48089
 
0.5%
53086
 
0.5%
Other values (255)5100
30.8%
(Missing)10142
61.3%
ValueCountFrequency (%)
102
 
< 0.1%
201
 
< 0.1%
404
 
< 0.1%
507
 
< 0.1%
6010
 
0.1%
651
 
< 0.1%
705
 
< 0.1%
8014
0.1%
9017
0.1%
10030
0.2%
ValueCountFrequency (%)
35001
< 0.1%
34801
< 0.1%
32601
< 0.1%
30001
< 0.1%
27301
< 0.1%
26001
< 0.1%
25501
< 0.1%
25001
< 0.1%
24001
< 0.1%
23901
< 0.1%

view
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size937.8 KiB
0
15123 
2
 
666
3
 
324
1
 
251
4
 
190

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters16554
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
015123
91.4%
2666
 
4.0%
3324
 
2.0%
1251
 
1.5%
4190
 
1.1%

Length

2022-10-03T23:36:34.803979image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-03T23:36:34.949992image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
ValueCountFrequency (%)
015123
91.4%
2666
 
4.0%
3324
 
2.0%
1251
 
1.5%
4190
 
1.1%

Most occurring characters

ValueCountFrequency (%)
015123
91.4%
2666
 
4.0%
3324
 
2.0%
1251
 
1.5%
4190
 
1.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number16554
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
015123
91.4%
2666
 
4.0%
3324
 
2.0%
1251
 
1.5%
4190
 
1.1%

Most occurring scripts

ValueCountFrequency (%)
Common16554
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
015123
91.4%
2666
 
4.0%
3324
 
2.0%
1251
 
1.5%
4190
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII16554
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
015123
91.4%
2666
 
4.0%
3324
 
2.0%
1251
 
1.5%
4190
 
1.1%

bathrooms
Categorical

HIGH CORRELATION

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size970.1 KiB
2.0
8224 
1.0
6635 
3.0
1504 
4.0
 
191

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters49662
Distinct characters6
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2.0
2nd row1.0
3rd row2.0
4th row1.0
5th row2.0

Common Values

ValueCountFrequency (%)
2.08224
49.7%
1.06635
40.1%
3.01504
 
9.1%
4.0191
 
1.2%

Length

2022-10-03T23:36:35.060999image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-03T23:36:35.192010image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
ValueCountFrequency (%)
2.08224
49.7%
1.06635
40.1%
3.01504
 
9.1%
4.0191
 
1.2%

Most occurring characters

ValueCountFrequency (%)
.16554
33.3%
016554
33.3%
28224
16.6%
16635
13.4%
31504
 
3.0%
4191
 
0.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number33108
66.7%
Other Punctuation16554
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
016554
50.0%
28224
24.8%
16635
20.0%
31504
 
4.5%
4191
 
0.6%
Other Punctuation
ValueCountFrequency (%)
.16554
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common49662
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.16554
33.3%
016554
33.3%
28224
16.6%
16635
13.4%
31504
 
3.0%
4191
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII49662
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.16554
33.3%
016554
33.3%
28224
16.6%
16635
13.4%
31504
 
3.0%
4191
 
0.4%

bedrooms
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size970.1 KiB
3.0
7846 
4.0
5208 
2.0
2163 
5.0
1176 
1.0
 
161

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters49662
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3.0
2nd row3.0
3rd row4.0
4th row5.0
5th row3.0

Common Values

ValueCountFrequency (%)
3.07846
47.4%
4.05208
31.5%
2.02163
 
13.1%
5.01176
 
7.1%
1.0161
 
1.0%

Length

2022-10-03T23:36:35.308018image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-03T23:36:35.445032image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
ValueCountFrequency (%)
3.07846
47.4%
4.05208
31.5%
2.02163
 
13.1%
5.01176
 
7.1%
1.0161
 
1.0%

Most occurring characters

ValueCountFrequency (%)
.16554
33.3%
016554
33.3%
37846
15.8%
45208
 
10.5%
22163
 
4.4%
51176
 
2.4%
1161
 
0.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number33108
66.7%
Other Punctuation16554
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
016554
50.0%
37846
23.7%
45208
 
15.7%
22163
 
6.5%
51176
 
3.6%
1161
 
0.5%
Other Punctuation
ValueCountFrequency (%)
.16554
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common49662
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.16554
33.3%
016554
33.3%
37846
15.8%
45208
 
10.5%
22163
 
4.4%
51176
 
2.4%
1161
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII49662
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.16554
33.3%
016554
33.3%
37846
15.8%
45208
 
10.5%
22163
 
4.4%
51176
 
2.4%
1161
 
0.3%

sqft_above
Real number (ℝ≥0)

HIGH CORRELATION

Distinct790
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1736.308626
Minimum290
Maximum8570
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size129.5 KiB
2022-10-03T23:36:35.583043image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/

Quantile statistics

Minimum290
5-th percentile840
Q11180
median1530
Q32140
95-th percentile3230
Maximum8570
Range8280
Interquartile range (IQR)960

Descriptive statistics

Standard deviation770.3646816
Coefficient of variation (CV)0.4436795797
Kurtosis2.58879067
Mean1736.308626
Median Absolute Deviation (MAD)430
Skewness1.316284753
Sum28742853
Variance593461.7427
MonotonicityNot monotonic
2022-10-03T23:36:35.734055image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1200167
 
1.0%
1300157
 
0.9%
1010155
 
0.9%
1400153
 
0.9%
1220146
 
0.9%
1340146
 
0.9%
1180145
 
0.9%
1060145
 
0.9%
1140140
 
0.8%
1100134
 
0.8%
Other values (780)15066
91.0%
ValueCountFrequency (%)
2901
 
< 0.1%
3801
 
< 0.1%
3901
 
< 0.1%
4202
< 0.1%
4301
 
< 0.1%
4401
 
< 0.1%
4702
< 0.1%
4804
< 0.1%
4902
< 0.1%
5002
< 0.1%
ValueCountFrequency (%)
85701
< 0.1%
80201
< 0.1%
76801
< 0.1%
64201
< 0.1%
63801
< 0.1%
63501
< 0.1%
62001
< 0.1%
61101
< 0.1%
60901
< 0.1%
60701
< 0.1%

sqft_living15
Real number (ℝ≥0)

HIGH CORRELATION

Distinct679
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1944.15084
Minimum460
Maximum5790
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size129.5 KiB
2022-10-03T23:36:35.873065image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/

Quantile statistics

Minimum460
5-th percentile1130
Q11470
median1810
Q32300
95-th percentile3190
Maximum5790
Range5330
Interquartile range (IQR)830

Descriptive statistics

Standard deviation649.2374266
Coefficient of variation (CV)0.3339439581
Kurtosis1.455088632
Mean1944.15084
Median Absolute Deviation (MAD)390
Skewness1.059620425
Sum32183473
Variance421509.2361
MonotonicityNot monotonic
2022-10-03T23:36:36.029377image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1440160
 
1.0%
1560159
 
1.0%
1540155
 
0.9%
1500150
 
0.9%
1460144
 
0.9%
1720138
 
0.8%
1580137
 
0.8%
1480134
 
0.8%
1610134
 
0.8%
1520134
 
0.8%
Other values (669)15109
91.3%
ValueCountFrequency (%)
4601
 
< 0.1%
6202
 
< 0.1%
6701
 
< 0.1%
6902
 
< 0.1%
7002
 
< 0.1%
7101
 
< 0.1%
7202
 
< 0.1%
7405
< 0.1%
7501
 
< 0.1%
7602
 
< 0.1%
ValueCountFrequency (%)
57905
< 0.1%
56001
 
< 0.1%
53801
 
< 0.1%
53301
 
< 0.1%
52201
 
< 0.1%
50801
 
< 0.1%
50701
 
< 0.1%
49501
 
< 0.1%
49301
 
< 0.1%
49131
 
< 0.1%

lat
Real number (ℝ≥0)

Distinct429
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4478.747493
Minimum47
Maximum47777
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size64.8 KiB
2022-10-03T23:36:36.181389image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/

Quantile statistics

Minimum47
5-th percentile47
Q147
median47
Q347
95-th percentile47559
Maximum47777
Range47730
Interquartile range (IQR)0

Descriptive statistics

Standard deviation13818.36921
Coefficient of variation (CV)3.085319999
Kurtosis5.826857873
Mean4478.747493
Median Absolute Deviation (MAD)0
Skewness2.797507278
Sum74141186
Variance190947327.7
MonotonicityNot monotonic
2022-10-03T23:36:36.332400image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4715010
90.7%
4768613
 
0.1%
4769512
 
0.1%
4754312
 
0.1%
4756711
 
0.1%
4755411
 
0.1%
4767911
 
0.1%
4763710
 
0.1%
4768710
 
0.1%
4769710
 
0.1%
Other values (419)1444
 
8.7%
ValueCountFrequency (%)
4715010
90.7%
471841
 
< 0.1%
471941
 
< 0.1%
471962
 
< 0.1%
472011
 
< 0.1%
472021
 
< 0.1%
472051
 
< 0.1%
472081
 
< 0.1%
472092
 
< 0.1%
472582
 
< 0.1%
ValueCountFrequency (%)
477772
 
< 0.1%
477767
< 0.1%
477753
 
< 0.1%
477749
0.1%
477735
< 0.1%
477723
 
< 0.1%
477713
 
< 0.1%
477691
 
< 0.1%
477685
< 0.1%
477672
 
< 0.1%

waterfront
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size937.8 KiB
0
16459 
1
 
95

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters16554
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
016459
99.4%
195
 
0.6%

Length

2022-10-03T23:36:36.470924image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-03T23:36:36.589932image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
ValueCountFrequency (%)
016459
99.4%
195
 
0.6%

Most occurring characters

ValueCountFrequency (%)
016459
99.4%
195
 
0.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number16554
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
016459
99.4%
195
 
0.6%

Most occurring scripts

ValueCountFrequency (%)
Common16554
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
016459
99.4%
195
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII16554
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
016459
99.4%
195
 
0.6%

floors
Categorical

HIGH CORRELATION

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size970.1 KiB
1.0
9811 
2.0
6254 
3.0
 
489

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters49662
Distinct characters5
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2.0
2nd row1.0
3rd row2.0
4th row1.0
5th row1.0

Common Values

ValueCountFrequency (%)
1.09811
59.3%
2.06254
37.8%
3.0489
 
3.0%

Length

2022-10-03T23:36:36.690939image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-03T23:36:36.811947image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
ValueCountFrequency (%)
1.09811
59.3%
2.06254
37.8%
3.0489
 
3.0%

Most occurring characters

ValueCountFrequency (%)
.16554
33.3%
016554
33.3%
19811
19.8%
26254
 
12.6%
3489
 
1.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number33108
66.7%
Other Punctuation16554
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
016554
50.0%
19811
29.6%
26254
 
18.9%
3489
 
1.5%
Other Punctuation
ValueCountFrequency (%)
.16554
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common49662
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.16554
33.3%
016554
33.3%
19811
19.8%
26254
 
12.6%
3489
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII49662
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.16554
33.3%
016554
33.3%
19811
19.8%
26254
 
12.6%
3489
 
1.0%

yr_renovated
Real number (ℝ≥0)

HIGH CORRELATION
MISSING

Distinct69
Distinct (%)10.6%
Missing15905
Missing (%)96.1%
Infinite0
Infinite (%)0.0%
Mean1995.50077
Minimum1934
Maximum2015
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size129.5 KiB
2022-10-03T23:36:36.929956image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/

Quantile statistics

Minimum1934
5-th percentile1962
Q11986
median2000
Q32008
95-th percentile2014
Maximum2015
Range81
Interquartile range (IQR)22

Descriptive statistics

Standard deviation16.39624362
Coefficient of variation (CV)0.00821660601
Kurtosis0.7796489903
Mean1995.50077
Median Absolute Deviation (MAD)11
Skewness-1.037317534
Sum1295080
Variance268.836805
MonotonicityNot monotonic
2022-10-03T23:36:37.083968image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
201473
 
0.4%
201330
 
0.2%
200524
 
0.1%
200024
 
0.1%
200723
 
0.1%
199020
 
0.1%
200620
 
0.1%
200319
 
0.1%
200417
 
0.1%
200916
 
0.1%
Other values (59)383
 
2.3%
(Missing)15905
96.1%
ValueCountFrequency (%)
19341
 
< 0.1%
19402
< 0.1%
19441
 
< 0.1%
19452
< 0.1%
19462
< 0.1%
19481
 
< 0.1%
19502
< 0.1%
19511
 
< 0.1%
19533
< 0.1%
19542
< 0.1%
ValueCountFrequency (%)
201511
 
0.1%
201473
0.4%
201330
0.2%
20129
 
0.1%
20118
 
< 0.1%
201011
 
0.1%
200916
 
0.1%
200815
 
0.1%
200723
 
0.1%
200620
 
0.1%

yr_built
Real number (ℝ≥0)

HIGH CORRELATION

Distinct116
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1970.676272
Minimum1900
Maximum2015
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size129.5 KiB
2022-10-03T23:36:37.260985image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/

Quantile statistics

Minimum1900
5-th percentile1915
Q11951
median1974
Q31996
95-th percentile2011
Maximum2015
Range115
Interquartile range (IQR)45

Descriptive statistics

Standard deviation29.34426029
Coefficient of variation (CV)0.01489045193
Kurtosis-0.6692721324
Mean1970.676272
Median Absolute Deviation (MAD)23
Skewness-0.4487016301
Sum32622575
Variance861.0856122
MonotonicityNot monotonic
2022-10-03T23:36:37.408993image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2014423
 
2.6%
2005355
 
2.1%
2006334
 
2.0%
2004332
 
2.0%
2003326
 
2.0%
1977320
 
1.9%
2007316
 
1.9%
1978310
 
1.9%
1968294
 
1.8%
2008277
 
1.7%
Other values (106)13267
80.1%
ValueCountFrequency (%)
190061
0.4%
190124
 
0.1%
190223
 
0.1%
190336
0.2%
190436
0.2%
190552
0.3%
190668
0.4%
190759
0.4%
190866
0.4%
190964
0.4%
ValueCountFrequency (%)
201530
 
0.2%
2014423
2.6%
2013152
 
0.9%
2012126
 
0.8%
2011102
 
0.6%
2010113
 
0.7%
2009178
1.1%
2008277
1.7%
2007316
1.9%
2006334
2.0%

long
Real number (ℝ)

Distinct608
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-109686.7868
Minimum-122512
Maximum-121
Zeros0
Zeros (%)0.0%
Negative16554
Negative (%)100.0%
Memory size64.8 KiB
2022-10-03T23:36:37.577006image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/

Quantile statistics

Minimum-122512
5-th percentile-122385
Q1-122321
median-122207
Q3-122073
95-th percentile-122
Maximum-121
Range122391
Interquartile range (IQR)248

Descriptive statistics

Standard deviation37055.04635
Coefficient of variation (CV)-0.3378259809
Kurtosis4.859395346
Mean-109686.7868
Median Absolute Deviation (MAD)118
Skewness2.618907141
Sum-1815755069
Variance1373076460
MonotonicityNot monotonic
2022-10-03T23:36:37.966036image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-1221611
 
9.7%
-12188
 
0.5%
-12236281
 
0.5%
-12236379
 
0.5%
-12229177
 
0.5%
-12230477
 
0.5%
-12235176
 
0.5%
-12235776
 
0.5%
-12235275
 
0.5%
-12236575
 
0.5%
Other values (598)14239
86.0%
ValueCountFrequency (%)
-1225121
< 0.1%
-1225031
< 0.1%
-1225021
< 0.1%
-1224971
< 0.1%
-1224791
< 0.1%
-1224751
< 0.1%
-1224741
< 0.1%
-1224721
< 0.1%
-1224671
< 0.1%
-1224652
< 0.1%
ValueCountFrequency (%)
-12188
 
0.5%
-1221611
9.7%
-1213152
 
< 0.1%
-1213161
 
< 0.1%
-1213191
 
< 0.1%
-1213211
 
< 0.1%
-1213251
 
< 0.1%
-1213522
 
< 0.1%
-1213591
 
< 0.1%
-1213642
 
< 0.1%

jhygtf
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct70
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean78.23365954
Minimum0
Maximum2015
Zeros15905
Zeros (%)96.1%
Negative0
Negative (%)0.0%
Memory size129.5 KiB
2022-10-03T23:36:38.124048image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum2015
Range2015
Interquartile range (IQR)0

Descriptive statistics

Standard deviation387.3169349
Coefficient of variation (CV)4.950771026
Kurtosis20.56126148
Mean78.23365954
Median Absolute Deviation (MAD)0
Skewness4.749415374
Sum1295080
Variance150014.408
MonotonicityNot monotonic
2022-10-03T23:36:38.278835image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
015905
96.1%
201473
 
0.4%
201330
 
0.2%
200524
 
0.1%
200024
 
0.1%
200723
 
0.1%
199020
 
0.1%
200620
 
0.1%
200319
 
0.1%
200417
 
0.1%
Other values (60)399
 
2.4%
ValueCountFrequency (%)
015905
96.1%
19341
 
< 0.1%
19402
 
< 0.1%
19441
 
< 0.1%
19452
 
< 0.1%
19462
 
< 0.1%
19481
 
< 0.1%
19502
 
< 0.1%
19511
 
< 0.1%
19533
 
< 0.1%
ValueCountFrequency (%)
201511
 
0.1%
201473
0.4%
201330
0.2%
20129
 
0.1%
20118
 
< 0.1%
201011
 
0.1%
200916
 
0.1%
200815
 
0.1%
200723
 
0.1%
200620
 
0.1%

sqft_lot
Real number (ℝ≥0)

HIGH CORRELATION

Distinct7871
Distinct (%)47.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9934.879667
Minimum520
Maximum137214
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size129.5 KiB
2022-10-03T23:36:38.435848image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/

Quantile statistics

Minimum520
5-th percentile1715.65
Q15000
median7480
Q310140
95-th percentile29944
Maximum137214
Range136694
Interquartile range (IQR)5140

Descriptive statistics

Standard deviation10957.36316
Coefficient of variation (CV)1.102918559
Kurtosis31.24093454
Mean9934.879667
Median Absolute Deviation (MAD)2507.5
Skewness4.698176897
Sum164461998
Variance120063807.5
MonotonicityNot monotonic
2022-10-03T23:36:38.591367image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5000289
 
1.7%
6000221
 
1.3%
4000197
 
1.2%
7200174
 
1.1%
480097
 
0.6%
750095
 
0.6%
450093
 
0.6%
960091
 
0.5%
840088
 
0.5%
360079
 
0.5%
Other values (7861)15130
91.4%
ValueCountFrequency (%)
5201
< 0.1%
6001
< 0.1%
6091
< 0.1%
6351
< 0.1%
6381
< 0.1%
6492
< 0.1%
6511
< 0.1%
6761
< 0.1%
6811
< 0.1%
6831
< 0.1%
ValueCountFrequency (%)
1372141
< 0.1%
1369151
< 0.1%
1367781
< 0.1%
1362901
< 0.1%
1306801
< 0.1%
1300171
< 0.1%
1276311
< 0.1%
1254521
< 0.1%
1220381
< 0.1%
1206611
< 0.1%

price
Real number (ℝ≥0)

HIGH CORRELATION

Distinct3237
Distinct (%)19.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean510603.4272
Minimum75000
Maximum7700000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size129.5 KiB
2022-10-03T23:36:38.744377image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/

Quantile statistics

Minimum75000
5-th percentile210000
Q1316000
median440000
Q3620000
95-th percentile963000
Maximum7700000
Range7625000
Interquartile range (IQR)304000

Descriptive statistics

Standard deviation323805.8088
Coefficient of variation (CV)0.634163015
Kurtosis50.59509917
Mean510603.4272
Median Absolute Deviation (MAD)141000
Skewness4.658301218
Sum8452529134
Variance1.048502018 × 1011
MonotonicityNot monotonic
2022-10-03T23:36:38.897390image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
350000137
 
0.8%
450000133
 
0.8%
425000128
 
0.8%
550000126
 
0.8%
500000123
 
0.7%
375000116
 
0.7%
325000114
 
0.7%
300000109
 
0.7%
400000108
 
0.7%
250000105
 
0.6%
Other values (3227)15355
92.8%
ValueCountFrequency (%)
750001
< 0.1%
780001
< 0.1%
800001
< 0.1%
810001
< 0.1%
820001
< 0.1%
825001
< 0.1%
830001
< 0.1%
840001
< 0.1%
850002
< 0.1%
890001
< 0.1%
ValueCountFrequency (%)
77000001
< 0.1%
70625001
< 0.1%
55700001
< 0.1%
53000001
< 0.1%
51108001
< 0.1%
45000001
< 0.1%
40000001
< 0.1%
38500001
< 0.1%
38000002
< 0.1%
37100001
< 0.1%

condition
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size937.8 KiB
3
10763 
4
4357 
5
1285 
2
 
126
1
 
23

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters16554
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row3
3rd row3
4th row3
5th row3

Common Values

ValueCountFrequency (%)
310763
65.0%
44357
26.3%
51285
 
7.8%
2126
 
0.8%
123
 
0.1%

Length

2022-10-03T23:36:39.039400image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-03T23:36:39.180411image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
ValueCountFrequency (%)
310763
65.0%
44357
26.3%
51285
 
7.8%
2126
 
0.8%
123
 
0.1%

Most occurring characters

ValueCountFrequency (%)
310763
65.0%
44357
26.3%
51285
 
7.8%
2126
 
0.8%
123
 
0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number16554
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
310763
65.0%
44357
26.3%
51285
 
7.8%
2126
 
0.8%
123
 
0.1%

Most occurring scripts

ValueCountFrequency (%)
Common16554
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
310763
65.0%
44357
26.3%
51285
 
7.8%
2126
 
0.8%
123
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII16554
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
310763
65.0%
44357
26.3%
51285
 
7.8%
2126
 
0.8%
123
 
0.1%

sqft_lot15
Real number (ℝ≥0)

HIGH CORRELATION

Distinct7013
Distinct (%)42.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8996.247856
Minimum659
Maximum57140
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size129.5 KiB
2022-10-03T23:36:39.312421image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/

Quantile statistics

Minimum659
5-th percentile1921.95
Q15011.25
median7500
Q39750
95-th percentile23045.85
Maximum57140
Range56481
Interquartile range (IQR)4738.75

Descriptive statistics

Standard deviation7636.814694
Coefficient of variation (CV)0.848888872
Kurtosis11.84745816
Mean8996.247856
Median Absolute Deviation (MAD)2400
Skewness3.168057171
Sum148923887
Variance58320938.67
MonotonicityNot monotonic
2022-10-03T23:36:39.454432image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5000339
 
2.0%
4000285
 
1.7%
6000230
 
1.4%
7200171
 
1.0%
7500111
 
0.7%
4800108
 
0.7%
450094
 
0.6%
840090
 
0.5%
360089
 
0.5%
408082
 
0.5%
Other values (7003)14955
90.3%
ValueCountFrequency (%)
6591
 
< 0.1%
6601
 
< 0.1%
7481
 
< 0.1%
7503
< 0.1%
7551
 
< 0.1%
7581
 
< 0.1%
7941
 
< 0.1%
8102
< 0.1%
8863
< 0.1%
8871
 
< 0.1%
ValueCountFrequency (%)
571401
 
< 0.1%
570632
 
< 0.1%
570001
 
< 0.1%
568271
 
< 0.1%
566286
< 0.1%
565681
 
< 0.1%
561922
 
< 0.1%
556571
 
< 0.1%
553221
 
< 0.1%
550231
 
< 0.1%

sqft_living
Real number (ℝ≥0)

HIGH CORRELATION

Distinct871
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2016.595143
Minimum290
Maximum12050
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size129.5 KiB
2022-10-03T23:36:39.612443image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/

Quantile statistics

Minimum290
5-th percentile920
Q11400
median1880
Q32478.75
95-th percentile3560
Maximum12050
Range11760
Interquartile range (IQR)1078.75

Descriptive statistics

Standard deviation848.7053985
Coefficient of variation (CV)0.4208605785
Kurtosis4.451354919
Mean2016.595143
Median Absolute Deviation (MAD)520
Skewness1.323768322
Sum33382716
Variance720300.8534
MonotonicityNot monotonic
2022-10-03T23:36:39.772456image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1440111
 
0.7%
1400110
 
0.7%
1300106
 
0.6%
1540102
 
0.6%
1480102
 
0.6%
101099
 
0.6%
166099
 
0.6%
120099
 
0.6%
190098
 
0.6%
156098
 
0.6%
Other values (861)15530
93.8%
ValueCountFrequency (%)
2901
< 0.1%
3801
< 0.1%
3901
< 0.1%
4202
< 0.1%
4301
< 0.1%
4401
< 0.1%
4702
< 0.1%
4802
< 0.1%
4901
< 0.1%
5001
< 0.1%
ValueCountFrequency (%)
120501
< 0.1%
100401
< 0.1%
92001
< 0.1%
80201
< 0.1%
80101
< 0.1%
77101
< 0.1%
76201
< 0.1%
74801
< 0.1%
73901
< 0.1%
73501
< 0.1%

tiene_sotano
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size937.8 KiB
0
10142 
1
6412 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters16554
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row0
4th row1
5th row1

Common Values

ValueCountFrequency (%)
010142
61.3%
16412
38.7%

Length

2022-10-03T23:36:39.918467image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-03T23:36:40.082479image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
ValueCountFrequency (%)
010142
61.3%
16412
38.7%

Most occurring characters

ValueCountFrequency (%)
010142
61.3%
16412
38.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number16554
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
010142
61.3%
16412
38.7%

Most occurring scripts

ValueCountFrequency (%)
Common16554
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
010142
61.3%
16412
38.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII16554
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
010142
61.3%
16412
38.7%

fue_renovada
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size937.8 KiB
0
15905 
1
 
649

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters16554
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
015905
96.1%
1649
 
3.9%

Length

2022-10-03T23:36:40.205489image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-03T23:36:40.346259image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
ValueCountFrequency (%)
015905
96.1%
1649
 
3.9%

Most occurring characters

ValueCountFrequency (%)
015905
96.1%
1649
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number16554
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
015905
96.1%
1649
 
3.9%

Most occurring scripts

ValueCountFrequency (%)
Common16554
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
015905
96.1%
1649
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII16554
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
015905
96.1%
1649
 
3.9%

yr_date
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1018.6 KiB
2014.0
11227 
2015.0
5327 

Length

Max length6
Median length6
Mean length6
Min length6

Characters and Unicode

Total characters99324
Distinct characters6
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2015.0
2nd row2015.0
3rd row2014.0
4th row2015.0
5th row2015.0

Common Values

ValueCountFrequency (%)
2014.011227
67.8%
2015.05327
32.2%

Length

2022-10-03T23:36:40.461267image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-10-03T23:36:40.599275image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
ValueCountFrequency (%)
2014.011227
67.8%
2015.05327
32.2%

Most occurring characters

ValueCountFrequency (%)
033108
33.3%
216554
16.7%
116554
16.7%
.16554
16.7%
411227
 
11.3%
55327
 
5.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number82770
83.3%
Other Punctuation16554
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
033108
40.0%
216554
20.0%
116554
20.0%
411227
 
13.6%
55327
 
6.4%
Other Punctuation
ValueCountFrequency (%)
.16554
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common99324
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
033108
33.3%
216554
16.7%
116554
16.7%
.16554
16.7%
411227
 
11.3%
55327
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII99324
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
033108
33.3%
216554
16.7%
116554
16.7%
.16554
16.7%
411227
 
11.3%
55327
 
5.4%

antiguedad_venta
Real number (ℝ)

HIGH CORRELATION
ZEROS

Distinct117
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean43.64552374
Minimum-1
Maximum115
Zeros320
Zeros (%)1.9%
Negative12
Negative (%)0.1%
Memory size129.5 KiB
2022-10-03T23:36:40.746796image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/

Quantile statistics

Minimum-1
5-th percentile4
Q118
median40
Q363
95-th percentile99
Maximum115
Range116
Interquartile range (IQR)45

Descriptive statistics

Standard deviation29.34580409
Coefficient of variation (CV)0.6723668678
Kurtosis-0.6696125664
Mean43.64552374
Median Absolute Deviation (MAD)23
Skewness0.4480073194
Sum722508
Variance861.1762178
MonotonicityNot monotonic
2022-10-03T23:36:40.905806image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9355
 
2.1%
11344
 
2.1%
8338
 
2.0%
10330
 
2.0%
0320
 
1.9%
37314
 
1.9%
7300
 
1.8%
36293
 
1.8%
46277
 
1.7%
47274
 
1.7%
Other values (107)13409
81.0%
ValueCountFrequency (%)
-112
 
0.1%
0320
1.9%
1220
1.3%
2133
 
0.8%
3120
 
0.7%
4102
 
0.6%
5150
0.9%
6250
1.5%
7300
1.8%
8338
2.0%
ValueCountFrequency (%)
11521
 
0.1%
11447
0.3%
11323
 
0.1%
11227
 
0.2%
11139
0.2%
11042
0.3%
10951
0.3%
10864
0.4%
10771
0.4%
10652
0.3%

Interactions

2022-10-03T23:36:29.050884image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:48.989394image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:52.033100image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:55.160680image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:57.739578image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:00.734082image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:03.089422image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:05.746961image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:07.998111image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:10.822249image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:13.378316image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:15.996717image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:18.624945image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:21.491987image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:23.924992image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:26.495064image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:29.193892image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:49.186410image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:52.230115image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:55.300690image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:57.883065image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:00.910244image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:03.232058image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:05.889973image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:08.132125image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:10.988265image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:13.520329image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:16.147726image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:18.770961image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:21.641999image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:24.059004image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:26.639075image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:29.348742image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:49.403594image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:52.404130image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:55.465568image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:58.033076image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:01.081011image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:03.405070image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:06.042983image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:08.281456image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:11.154276image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:13.676339image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:16.298741image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:18.921973image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:21.801010image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:24.215017image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:26.795499image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:29.720773image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:49.602611image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:52.581878image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:55.609577image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:58.173092image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:01.220023image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:03.534090image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:06.179995image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:08.431465image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:11.292288image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:13.808351image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:16.527756image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:19.066984image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:21.932020image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:24.350027image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:26.946511image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:29.865782image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:49.773624image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:52.796104image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:55.758588image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:58.339559image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:01.378655image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:03.683223image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:06.331007image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:08.579476image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:11.445299image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:13.961185image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:16.731776image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:19.235996image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:22.082034image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:24.730058image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:27.107108image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:30.004792image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:49.931635image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:52.977679image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:55.901601image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:58.516573image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:01.542183image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:03.838233image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:06.452830image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:08.736490image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:11.596310image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:14.107194image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:16.949673image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:19.479017image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:22.229043image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:24.850068image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:27.330643image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:30.144312image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:50.100832image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:53.139697image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:56.056496image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:58.688587image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:01.703196image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:03.973244image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:06.590841image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:08.897503image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:11.762336image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:14.266210image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:17.105686image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:19.863821image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:22.388679image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:25.007079image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:27.495654image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:30.281322image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:50.257841image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:53.331004image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:56.207504image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:58.826599image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:01.825203image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:04.120256image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:06.717681image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:09.050518image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:11.913346image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:14.403055image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:17.242698image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:20.014832image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:22.543694image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:25.142090image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:27.646667image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:30.450335image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:50.429855image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:53.563019image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:56.383030image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:58.979611image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:01.985215image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:04.282271image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:06.868692image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:09.238528image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:12.079361image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:14.795650image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:17.411710image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:20.217852image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:22.709704image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:25.298365image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:27.803960image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:30.605348image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:50.627865image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:53.724031image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:56.544043image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:59.178626image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:02.120228image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:04.433284image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:07.012705image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:09.623559image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:12.250373image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:14.944289image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:17.561721image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:20.398352image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:22.863317image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:25.454388image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:27.992978image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:30.751357image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:50.804879image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:53.881046image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:56.713564image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:59.599501image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:02.256237image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:04.799468image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:07.148714image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:09.790322image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:12.427386image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:15.095300image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:17.707733image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:20.564367image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:23.018415image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:25.601390image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:28.154987image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:30.913370image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:51.032901image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:54.340619image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:56.888000image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:59.788517image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:02.394026image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:04.968481image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:07.293235image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:09.974335image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:12.586248image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:15.243144image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:17.856748image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:20.706378image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:23.178427image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:25.757436image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:28.309590image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:31.074380image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:51.215912image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:54.521632image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:57.045013image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:59.982530image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:02.547037image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:05.140494image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:07.431246image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:10.146347image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:12.761266image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:15.404155image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:18.021622image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:20.866938image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:23.331954image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:25.896446image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:28.463312image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:31.237395image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:51.449127image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:54.672641image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:57.230028image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:00.209702image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:02.681049image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:05.293504image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:07.584256image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:10.323213image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:12.923277image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:15.560684image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:18.172634image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:21.026950image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:23.489048image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:26.047457image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:28.616322image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:31.370642image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:51.649879image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:54.838657image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:57.407044image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:00.374712image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:02.809403image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:05.440430image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:07.734091image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:10.496227image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:13.071293image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:15.699692image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:18.324922image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:21.178964image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:23.629055image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:26.201470image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:28.761335image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:31.529165image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:51.853899image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:55.015667image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:35:57.598566image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:00.552071image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:02.952411image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:05.598441image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:07.873101image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:10.667240image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:13.227303image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:15.856704image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:18.477933image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:21.336975image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:23.774472image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:26.350055image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
2022-10-03T23:36:28.901872image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/

Correlations

2022-10-03T23:36:41.299839image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
2022-10-03T23:36:41.654568image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
2022-10-03T23:36:41.987103image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
2022-10-03T23:36:42.290126image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.
2022-10-03T23:36:42.513146image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

2022-10-03T23:36:31.776182image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
A simple visualization of nullity by column.
2022-10-03T23:36:32.528111image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2022-10-03T23:36:32.806164image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
2022-10-03T23:36:32.929683image/svg+xmlMatplotlib v3.5.3, https://matplotlib.org/
The dendrogram allows you to more fully correlate variable completion, revealing trends deeper than the pairwise ones visible in the correlation heatmap.

Sample

First rows

df_indexzipcodegradesqft_basementviewbathroomsbedroomssqft_abovesqft_living15latwaterfrontfloorsyr_renovatedyr_builtlongjhygtfsqft_lotpriceconditionsqft_lot15sqft_livingtiene_sotanofue_renovadayr_dateantiguedad_venta
0198579800610NaN02.03.02610.03140.04702.0NaN1993.0-1221150.08481.0810000.0310008.02610.0002015.022.0
114014980338650.011.03.01560.02210.04701.0NaN1974.0-1221890.08955.0685000.038976.02210.0102015.041.0
232909980058NaN02.04.02650.02230.04702.0NaN1986.0-1221540.018295.0725000.0319856.02650.0002014.028.0
316305980017900.001.05.01050.01660.04701.0NaN1962.0-1222890.08720.0274000.038030.01950.0102015.053.0
46647980117320.002.03.01310.01620.04701.0NaN1986.0-1222320.06449.0445000.037429.01630.0102015.029.0
55865980408850.002.04.01760.02550.04701.0NaN1978.0-1222290.08760.0762500.0410376.02610.0102014.036.0
68009980048NaN11.03.01700.02630.04701.0NaN1954.0-1220.014133.0979000.0417376.01700.0002014.060.0
74731980118780.003.05.02090.02640.04702.0NaN2007.0-1221920.04369.0540000.034610.02870.0102014.07.0
838480980529NaN02.04.02700.02730.04702.0NaN2004.0-1221160.08810.0690000.035100.02700.0002014.010.0
913246980727530.001.03.01130.01260.04701.0NaN1976.0-1221620.09673.0375000.039681.01660.0102014.038.0

Last rows

df_indexzipcodegradesqft_basementviewbathroomsbedroomssqft_abovesqft_living15latwaterfrontfloorsyr_renovatedyr_builtlongjhygtfsqft_lotpriceconditionsqft_lot15sqft_livingtiene_sotanofue_renovadayr_dateantiguedad_venta
165449302981487NaN01.02.0940.01890.04701.0NaN1954.0-1223290.06000.0246500.028547.0940.0002015.061.0
1654526872980086NaN01.03.01270.01210.04701.0NaN1959.0-1221180.08000.0475000.047875.01270.0002014.055.0
1654649558981986NaN21.02.01170.01380.04701.0NaN1911.0-1223210.08925.0175000.037440.01170.0002014.0103.0
16547146981176120.001.02.0860.0980.04701.0NaN1918.0-1223660.02130.0400000.042800.0980.0102014.096.0
165489396980657NaN02.03.01950.02190.04702.0NaN2007.0-1218690.07263.0409000.035900.01950.0002014.07.0
1654914466981987NaN02.04.01780.01630.04702.0NaN1991.0-1223020.06000.0175000.036000.01780.0002014.023.0
1655030056980426NaN01.03.0840.0920.04701.0NaN1969.0-1220850.05525.0191000.055330.0840.0002015.046.0
165515824981067550.002.03.01230.01780.04701.0NaN1990.0-1223530.06771.0310000.036771.01780.0102014.024.0
1655216712980387NaN02.03.01340.01060.04702.0NaN1995.0-1220380.03011.0230000.033232.01340.0002014.019.0
165532379807510NaN02.03.03240.02970.04702.0NaN1994.0-1220380.07857.0800000.037857.03240.0002014.020.0